# Multilingual speech synthesis
Outetts 1.0 0.6B GGUF
Apache-2.0
OuteTTS-1.0-0.6B GGUF is a multilingual text-to-speech model that supports speech synthesis and cloning, providing efficient and accurate speech generation capabilities.
Speech Synthesis Supports Multiple Languages
O
Mungert
854
1
Llama OuteTTS 1.0 1B
OuteTTS 1.0 is a multilingual text-to-speech model based on the Llama architecture, supporting 20 languages with high-quality speech synthesis and voice cloning capabilities.
Speech Synthesis Supports Multiple Languages
L
unsloth
233
2
Llama OuteTTS 1.0 1B GPTQ 8bit
OuteTTS 1.0 is a 1B-parameter text-to-speech model supporting multilingual speech synthesis and voice cloning
Speech Synthesis Supports Multiple Languages
L
adriabama06
15
1
Voila Autonomous Preview
MIT
Voila is a large family of speech-language foundation models designed to enhance human-computer interaction, supporting real-time, low-latency voice interaction and multilingual processing.
Text-to-Audio
Transformers Supports Multiple Languages

V
maitrix-org
332
8
Voila Audio Alpha
MIT
Voila is a large family of speech-language foundation models designed to enhance human-computer interaction, supporting real-time, low-latency voice interaction and multilingual processing.
Text-to-Audio
Transformers Supports Multiple Languages

V
maitrix-org
175
3
Voila Chat
MIT
Voila is a brand-new large-scale speech-language foundation model series designed to elevate human-computer interaction to unprecedented levels.
Text-to-Audio
Transformers Supports Multiple Languages

V
maitrix-org
2,423
32
Voila Base
MIT
Voila is a brand-new family of large-scale speech-language foundation models designed to elevate human-computer interaction to new heights.
Speech Recognition
Transformers Supports Multiple Languages

V
maitrix-org
662
10
Voila Tokenizer
MIT
Voila is a large-scale voice-language foundation model series designed to enhance human-computer interaction, supporting multiple audio tasks and languages.
Text-to-Audio
Transformers Supports Multiple Languages

V
maitrix-org
4,912
3
XTTS V2 Argentinian Spanish
Other
ⓍTTS is a voice generation model that can clone a voice with just 6 seconds of audio and apply it to different languages, supporting Argentinian-accented Spanish.
Speech Synthesis Spanish
X
UNRN
16
1
XTTS V2 Argentinian Spanish
Other
ⓍTTS is a speech generation model that can clone voices with just 6 seconds of audio and apply them to different languages. No need for hours of extensive training data.
Speech Synthesis Spanish
X
marianbasti
44
5
Mms Tts Lav
Latvian text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis
Speech Synthesis
Transformers

M
facebook
55
0
Mms Tts Uzb Script Cyrillic
Uzbek (Cyrillic script) text-to-speech model developed by Meta, based on VITS architecture, supporting high-quality speech synthesis
Speech Synthesis
Transformers

M
facebook
776
6
Mms Tts Urd Script Devanagari
Urdu text-to-speech model developed by Meta, supports Devanagari script transliterated text input to generate high-quality speech output
Speech Synthesis
Transformers

M
facebook
315
0
Speecht5 TTS Haitian
A Haitian Creole text-to-speech model fine-tuned based on the SpeechT5 architecture, trained using Carnegie Mellon University's Haitian language dataset
Speech Synthesis
Transformers Other

S
idajikuu
139
4
Bark Small
Bark is a Transformer-based text-to-audio model created by Suno, capable of generating highly realistic multilingual speech, music, background noise, and simple sound effects.
Speech Synthesis
Transformers Supports Multiple Languages

B
ylacombe
1,947
2
Tortoise Tts
Apache-2.0
TorToiSe is a text-to-speech program focused on multilingual capabilities and highly realistic prosody and intonation.
Speech Synthesis
Transformers English

T
Gatozu35
19
15
Featured Recommended AI Models